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Fixed  State  Utility  Scaling 
Comparison  of  Different  Criteria 

Shin-ichi  Mayekawa 

The  University  of  Iowa 


UTILITY  SCALING 


I.  Introduction 


In  the  Bayesian  view  of  decision  making,  the  posterior  probability 
distribution  of  the  variable  of  interest  and  its  utility  function  are 
the  essential  ingredients  of  a  decision  process.  The  method  used  to 
assess  the  utility  function  installed  in  the  CADA  monitor  (Novick, 
Hamer,  Libby,  Chen,  and  Woodworth,  1980)  is  called  the  fixed-state 
utility  assessment  procedure  (Novick  and  Lindley,  1979)  in  which  the 
investigator  is  asked  to  specify  subjective  indifference  probabilities 
for  gamble  pairs.  The  assessment  of  the  utility  function  or  the 
scaling  of  the  subjective  utility  is  then  done  through  a  least  squares 
procedure  based  on  the  log-odds  transformed  observations  (subjective 
indifference  probabilities). 

In  Novick  and  Lindley  (1979),  the  following  notation  is  used: 

Observations 


Pijk’  1  O’1”--’  N-1 

j  =  1,2,...,  N 

k  =  2,3,. . . ,  N  +  1 


i  <  j  <  k 

where  N  +  2  is  the  number  of  ordered  outcomes,  p. ..  denotes  the 

ijk 

numerical  values  of  the  subjective  probability  with  which  the  subject 
receives  the  outcome  k  in  a  gamble  between  outcome  i  and  outcome  k, 
which  makes  the  gamble  indifferent  to  receiving  the  outcome  j  for  sure. 


Parameters 

If  Ui,  i-0,1,2,...,  N+l ,  uQ  =  0,  uN+1  *  1,  is  the  utility  of  the 
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i-th  outcome,  then  the  true  indifference  probabilities  are 

(1)  Pijk:  =  (v  Ui)/(V  ui)* 

Given  N  observed  indifference  probabilities  the  N  unspecified  utilities 
can  be  calculated  under  certain  conditions.  However,  in  most  situations 
the  observations  contain  errors  and  it  is  therefore  useful  to  obtain 
additional  observations  and  to  attempt  to  fit  a  utility  function. 

In  this  paper  several  different  methods  of  fitting  (scaling)  will 
be  discussed.  Throughout  this  paper,  we  assume  that  the  utilities  are 


monotonic  to  the  outcomes:  that  is,  under  the  appropriate  ordering  of 


the  outcomes,  we  assume  that  if  i < j  then  u^ < u ^ .  We  further  assume 

that  p, ..  's  are  measured  in  the  absolute  scale,  that  is,  p,.  ,'s  are 
ijk  ’  Kijk 

assumed  to  have  a  unique  origin  and  a  unique  unit. 


II.  Least  Squares  Method 

The  simplest  way  to  estimate  the  utilities  is  to  solve  the  least 


squares  problem: 

(2)  Q.  :  =  [  (p  -  P...)2]  min  w.  r .  t .  u' s ,  where  ^  denotes 

ijk  ^  ijk 

summation  over  all  the  observed  p. ’s,  and  p. ..  is  the  function  of 

ijk  *ijk 

the  utilities  as  defined  by  (1).  This  solution,  although  straight 
forward,  lacks  the  consideration  of  the  comparative  magnitude  of  the 
deviations  or  the  relative  weight  of  each  observation.  As  indicated 
by  Novick  and  Lindley  (1979),  the  value  of  p..,  close  to  1/2  seems  to 

X  j  tc 

have  a  relatively  large  magnitude  of  deviations,  that  is,  the  closer 
p^jk's  are  to  1/2,  the  less  reliable  they  are.  Thus,  observed  values 
Pijlc  distant  from  1/2  would  have  little  influence  on  the  solution. 


One  way  to  handle  this  problem  is  to  define  the  least  squares 
criterion  using  transformed  observations.  The  transformation  which 
removes  the  heterogeneity  of  the  deviations  of  the  observations  must 
be  chosen.  Among  the  transformations  which  serve  this  purpose,  we  only 
consider  the  following  two  transformations: 

(3)  fL(x)  =  ln(  x/(l-x))f 


and 

(4)  Fa(x)  =  sin  1(x1/2)  . 

f^  and  f  are  called  log-odds  transformation  and  arcsine  transformation, 
respectively. 

The  least  squares  criterion  defined  on  these  transformed  values  are 

(5)  ^  =  lTk  [  (^L(Pijk)  "  fL(Pijk))2]  ",nin’ 
and 


W  -X  -  fA<fiJk»2)  *  "ln- 

Another  way  to  handle  this  problem  is  to  define  the  weighted 
least  squares  criterion  such  that  the  heterogeneity  of  the  deviations 
may  be  absorbed  in  the  weights.  Because  the  observations  are  assumed  to 
be  less  reliable  around  1/2,  we  may  define  the  weighted  least  squares 
criterion  as  follows: 


(7) 


V  =  X  I*. 

ijk 


ijk(pijk  “  pijk)  1  min* 


where 


(8)  Wijk  "  1/lpijk(1-pijk)]’ 


In  general  the  least  squares  criteria  can  be  written  as 
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where 

W  = 

aijk 

i 

for 

a  = 

2,3,  and  4 

s 

for 

a  - 

1, 

and 

f  (x)  - 

X 

for 

a  = 

1  and  2. 

a 

= 

fL(x) 

for 

a  = 

3, 

= 

fA(x) 

for 

a  = 

4. 

III.  Solution 


Rewriting  (9)  in  a  matrix  form,  we  have 

(10)  Qa:  =  [fa(£)  -  fa(£)],Wa[fa(£)  -  fa(£>]  ->  min, 

where  denoting  the  number  of  the  observations  by  K,  f  (x)  is  the 

— a 

K  x  1  vector  whose  e-th  element  is  f^Cx^,  Wa  is  the  K  x  K  diagonal 
matrix  whose  e-th  diagonal  element  is  w  ,  and  £  and  £  are  the 
K  x  1  vectors  consisting  of  p^^'8  and  p^.^'s,  respectively.  The 
subscript  "e"  stands  for  any  ijk  triad. 

Because  Uq  and  are  fixed,  there  are  only  N  free  values 

of  u  which  must  be  estimated.  The  solution  of  the  problem  is  given 
by  solving 


(ID 


Du 


Du 


-  0 

where  u  is  the  N  x  1  vector  of  the  utility  scale,  i.e. 

u  -  [Uj,  u2,  ....  Uj^]  ', 

_3u  -j 


and 


is  the  K  x  n  matrix  whose  (e,c)  element  is 


3f  (p  )  3f  (p,  .. ) 
a  e_  _  a  ^ijk 


3u  3u 

c  c 


[The  weight  matrix  is  treated  as  a  constant  even  when  a  =  1.] 


Specifically,  in  a  scalar  form  3f  (p..,)  can  be  written  as  follows 

a  ilk 


3u 


(12)  For  a  =  1  and  2, 


3u 


apiit 

3u 

c 


r-(uk-u.)/(uk-u.)2 


l/(uk-u.) 


|-(jj-u1)/(uk-ui)' 


if  c  =  i 
if  c  =  j 
if  c  =  k. 


for  a  =  3, 


3fa<Pi,1k> 


8u 


'(pijk+ '-fijk) 


3p 


ilk 


ijk 


3u 


and  for  a  =  4, 


3fa(W 

3ur 


afiik 


W^jk'^iik'  s“< 

Among  the  various  numerical  methods  for  solving  (11),  the 
Gauss-Newton  method  seems  to  be  appropriate  here  because  it  does 
not  require  the  evaluation  of  the  second  derivatives  and  yet  is  as 
efficient  as  the  Newton-Raphson  method.  The  direction  for  search  in 
the  Gauss-Newton  method  is  defined  by 


(13)  S  (u)  =  - 


3f 

— c 

i  3t 


(£> 


W 


a  f  <£> 


-1  3Q 


3u 


3ui  /  a  \  du 

The  Gauss-Newton  method  with  the  step-size  halving  is  always 
convergent.  However,  we  must  consider  the  monotonicity  restrictions 
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Usually,  the  restrictions  can  be  enforced  by  setting  the  parameters 
which  fall  out  of  the  proper  region  equal  to  the  boundary  surface. 
However,  we  cannot  apply  this  simple  method  because  our  restrictions 
do  not  include  the  equalities:  that  is,  u^  must  be  strictly  less 
than  u^+^.  To  enforce  the  monotonicity  restrictions,  therefore,  we 
modified  the  step-size  halving  procedure  of  the  Gauss-Newton  method 
so  that  all  the  updated  parameters  may  satisfy  the  restrictions  in 
each  iteration.  As  long  as  the  initial  estimates  satisfy  the  restric¬ 
tions,  this  method  is  guaranteed  to  converge  to  the  restricted  minimum. 

For  the  weighted  least  squares  case  (a  =  1),  the  W  matrix  must 
be  successively  updated  in  each  iteration  by 


(14)  wlijk  =  1/Pijk(l  -  P 
using  the  latest  value  of 


IV .  Maximum  Likelihood  Method  Based  on  the  Normal  Distribution 

In  the  previous  sections,  we  used  the  word  "relative  magnitudes 
of  the  deviations"  without  introducing  the  statistical  concept.  In 
this  section,  however,  we  assume  that  the  observations  are  the  random 


variables.  Under  the  assumption  that  fa(p  has  the  normal  distri- 

-  2 

button  with  the  mean  f  (p  .  )  and  the  constant  variance 

3  lj  K 


(15)  fa(Pijk)%  "‘VW’  ° 


for  a=2 , 3,  and  4, 

or  under  the  assumption  that  Pjj^  has  t*,e  normal  distribution 
the  mean  Pijk  and  the  variance  cpljk  (1-p  ),  where  c  is  the 


with 

unknown 


parameter  to  be  estimated. 
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(16)  p.jk„  N(pijk,  cpijk  (1-Pijk)), 

it  can  be  shown  that  the  least  squares  solutions  described  in  the 
previous  sections  coincide  with  the  maximum  likelihood  solutions. 
That  is,  the  derivatives  of  the  log  likelihood  function  for  the 
assumption  (15) , 


(17) 


it  N  .  .  N  ,  2 

InL  =  ~  2  ln(2ir  )  -  -j  In  a 


2  a  ljk 


^Pij^  -  fa(pijk)r 


for  a  =  2,3,  and  4, 

with  respect  to  a ^  and  u^'s  result  in  the  equations 

<18>  °2 ' 
and 

<19>  -iiak  .  I  «a<pljk>  -  f>ljk» 

3u  2  ilk  3u 

c  a  J  c 

and  for  the  assumption  (16) ,  the  derivatives  of  the  log  likelihood 

function 


N 


(20)  InL  =  -  j  ln(2 


w-i 


2  2 
Inc  d  . 

ijk 


1  y  (pijk~  pijk)“ 

2  ijk  2  2 

ijk 


where 


d2ijk '  p«k 

2 

with  respect  to  c  and  uc's  result  in  the  equations 


(21)  c 


2  -  I  X  (plik  ~  p i j k^  2 

-  N  ijk  Pijk(l  -  Pljk) 


8 


Pjjk 
3u 

c 

These  equations  can  be  written  as  (11)  using  the  same  notation,  and 
the  Gauss-Newton  method  is  also  appropriate  as  the  *umerical  method. 

One  of  the  merits  of  the  maximum  likelihood  solution  is  the 
inverse  matrix  of 


and 

(22) 


31nL 


i  X 

c2'  iJk 


(pijk  ~  Pj.jk^ 


(23)  I  (u):  = 
a  — 


a4  (£)1 

3  u 


3  u 


which  appeared  in  (12),  can  provide  the  asymptotic  variance/covariance 
matrix  of  the  estimated  parameters  (see  Jennrich  and  Moor,  1975). 

From  the  Bayesian  point  of  view  this  is  equivalent  to  saying  that 
with  the  uniform  prior  distribution,  the  posterior  distribution  of 
the  parameter  u  is  approximately  multivariate  normal  with  the  mean 
vector  ii  ,  the  maximum  likelihood  estimates,  and  the  variance/covariance 
matrix  I^(u)  (See  Lindley  1965).  That  is,  using  the  approximate 
posterior  distributions,  we  car.  draw  rough  inferences  concerning  the 
imprecision  of  the  estimated  utility  scale.  For  example,  we  can  draw 
the  approximate  95  percent  HDR  of  the  utility  function  using  the 
posterior  marginal  distributions. 


V.  Maximum  Likelihood  Method  Based  on  the  Beta  Distribution 

Under  the  assumption  that  each  observation  has  the  beta  distribution 
with  parameters  ap^^+b  and  a(l-p^k)+b. 


(24)  pijk. 


6<'apijk+b’  a^1_Pijk)+b^  ’ 
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where  a  i.s  the  unknown  parameter  related  to  the  variance  and  b  is  the 
constant  related  to  the  location,  the  solution  which  maximizes 


(25)  lnL:=  In 
ijk 


ap.  +b  , .  a(  l-p .  )  +  b 
Pijk  ljk  (I  ~  pijk)  ljk 

L  BtaPijk  +  b»  a(1_Pijic)  +  b] 


where  B[a,6]  denotes  the  beta  function  with  the  arguments  a  and  B  , 

was  proposed  by  Mayekawa  (1980).  In  this  formulation,  the  variance 

of  p...  can  be  written  as 
ijk 


(26) 


(aPijk  +  b)  (a(1  ~  Pijk}  +  b) 

(a  +  2b)2  (a  +  2b  +1) 


which  shows  that  this  assumption  is  compatible  with  the  previous 
consideration  of  the  errors. 

In  this  section  we  will  consider  the  two  specific  cases  in  which 

b  is  set  equal  to  zero  and  one,  respectively.  When  b  is  set  equal  to 

zero,  the  mean  of  the  distribution  is  equal  to  p...  and  when  b  is  set 

ijk 

equal  to  one,  the  mode  of  the  distribution  is  equal  to  p_k> 

Fischer's  information  matrix  under  this  assumui''  n  is  given 


ln+i,  n+i  /*~  tp 
ijk 


[ y  (ap„k+b)  +  ^  ^a^-Pijk^  +b^  » 

[  l^'(aPijk+  b)  +  VV(a(l-Pijk)  +b>h 


/(a  P..k  +  b)  +  (1-Pijk)  CV(a(l-p..k)  +  b) 
(a  +  2b)], 
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and 


IH,N+1  IN+1,«, 


Z  +  b) 


Ip  (a(l  -  Pijk)  +  b)]  , 


where 


*’<«>  -  j  ln,rw 

d  x 


To  estimate  the  parameters  Fisher's  scoring  method,  with  the  modified 
step-size  halving  as  before,  is  used. 


VI.  Results 

Using  the  nine-point  fixed-state  method  in  Component  31  of  the 
CADA  monitor,  in  which  N  =  7,  fourteen  indifference  probabilities 
were  collected  for  seventeen  subjects  (K  =  14).  The  outcome  used  in 
the  assessment  procedure  was  the  GPA  and  the  subjects  were  the  students 
of  The  University  of  Iowa.  For  each  subject,  six  utility  scales  based 
on  the  different  criteria  were  estimated.  In  order  to  see  the 
differences  of  the  utility  scales  the  interscale  correlation 
coefficients  (CC)  and  the  sums  of  the  interscale  absolute  differences 
(SAD)  were  calculated  for  each  subject. 

Generally  speaking,  the  differences  were  very  little  for  all  the 
subjects.  Among  seventeen  subjects,  the  minimum  of  the  mean  correlation 
coefficients  (MCC)  over  six  scales  was  .96732  and  the  maximum  of  the 
mean  of  the  sums  of  the  absolute  differences  (MSAD)  was  .09056,  both 
of  which  were  obtained  for  the  subject  /M.  However,  close 


L 
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examination  shows  that  while  the  differences  between  the  scale  based 
on  the  beta  distribution  with  b  =  1  and  the  other  scales  are  fairly 
large,  the  differences  among  the  other  scales  are  very  small  for  this 
subject,  which  may  indicate  that  the  beta  scales  with  b  =  1  converged 
at  the  local  minimum  for  the  subject  #4. 

The  minimum  of  MCC  and  the  maximum  of  MSAD  among  the  sixteen 
subjects  excluding  the  subject  #4  was  .99325  for  the  subject  #9  and 
.04912  for  the  subject  //3  respectively.  The  estimated  utility  scales 
and  the  95  percent  approximate  HDR  for  the  subject  It 3  are  shown  in 
Fig.  1  and  Fig.  2.  As  indicated  above,  the  differences  of  the  six 
utility  scales  are  the  largest  for  this  subject  except  subject  It 4. 

For  all  the  other  fifteen  subjects  the  utility  scales  and  their  HDR's 
were  very  similar  to  each  other.  One  of  the  typical  results  (subject 
It 8)  are  shown  in  Fig.  3  and  Fig.  4. 

While  there  seems  to  be  no  consistent  qualitative  differences 
among  the  different  scales,  it  is  clear  that  the  fit  of  the  model  and 
the  interscale  differences  are  negatively  correlated.  That  is,  if  the 
fit  is  good  (lower  sum  of  the  residuals  for  the  normal  models,  or 
higher  likelihood  for  the  beta  models),  the  differences  among  the 
scales  become  smaller.  In  Table  1  the  correlation  coefficients  among 
the  least  squares  criterions  (Q^  through  Q^)  minimized,  the  log 
likelihoods  (with  b  =  0  and  b  =  1),  CC's,  and  SAD's  over  seventeen 
subjects  are  shown.  It  is  also  clear  that  the  fit  and  the  size  of 
the  HDR's  for  the  beta  scales  with  b  =  1  seem  to  be  very  sensitive  to 
the  fit  and  always  resulted  in  the  largest  intervals  when  the  fit  was 
poor.  (The  HDR  for  the  subject  It 3  in  Fig.  2  lies  out  of  the  range  of 


the  graph.) 
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VII.  Discussion 

As  shown  in  the  previous  sections,  for  almost  all  the  subjects, 
the  differences  among  the  utility  scales  based  on  the  different 
criteria  or  distributional  assumptions  are  very  small.  The  fact  that 


the  least  square  solution  under  the  assumption  that  p 


ijk’ 


s  have  the 


constant  variances  all  over  the  range  showing  no  qualitative  differences 


tells  us  that  the  least  squares  solutions  are  very  robust.  While  the 


other  scales  seem  to  be  theoretically  more  sound,  the  least  squares 
solution  based  on  the  probability  itself  might  be  recommended  as  the 


simplest  method.  However,  as  for  the  computations,  there  are  no 
great  differences  among  the  least  squares  solutions,  which  indicates 


the  choice  is  very  arbitrary. 

On  the  contrary,  the  maximum  likelihood  solutions  based  on  the 
beta  distributions  require  a  great  deal  of  effort  to  evaluate  the 
beta  function,  psy  function,  and  the  derivative  of  the  psy  function, 
which  may  affect  the  computational  time  on  small  computers  seriously 
Finally,  we  may  mention  the  reliability  of  the  utility  scales. 
Unlike  the  area  of  the  mental  test  theory,  little  attention  has  been 


paid  to  the  reliability  concept  in  psychological  scaling. 


The  reasons  are: 


1)  In  the  mental  test  theory,  the  observations  and  the  estimated 
scales  are  essentially  the  same  quantity,  which  leads  to  the  variance- 
ratio  definition  of  the  reliability  coefficient. 
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2)  In  the  psychological  scaling,  usually,  the  observations  and  the 
scale  have  a  functional  relation.  For  example,  we  have  to  estimate 
the  utility  scale  through  the  subjective  probabilities.  This  fact 
makes  it  difficult  to  calculate  the  observed  variance. 

If  we  have  the  repeated  observations,  it  is  possible  to  calculate 
the  interscale  correlation  coefficient  among  the  two  scales  based  on 
the  different  sets  of  observations  like  split-half  methods.  On  the 
contrary,  it  is  possible  to  define  the  reliability  of  the  scale  as 
Var(p. 


(28)  P  = 


iiK 


VarCPf.k) 


which  can  be  calculated  without  repeated  observations.  However, 
because  this  quantity  is  defined  on  the  probability,  rather  than 
the  utility,  it  may  not  be  appropriate  to  call  this  the  reliability 
of  the  utility  scale.  Further  study  of  the  reliability  seems  to  be 
necessary. 
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